Project-Team:BIPOP

Inria | Raweb 2012 | Presentation of the Project-Team BIPOP | BIPOP Web Site


	PDF	e-Pub

Previous |

Home | Next next

Section: New Results

Optimization

Optimization algorithms for large-scale machine learning problems, and applications in computer vision

Participant : Jérôme Malick.

This collaboration with Zaid Harchaoui (Inria, LEAR Team) has been growing since summer 2010. It also involves Miro Dudik (Microsoft Research NYC) and a student who just started his PhD in october 2012 (after his master with us).

The explosion of data that we are experiencing (Big Data) lead us to huge-scale learning problems, new challenges for statistical learning and numerical optimisation algorithms. For example, the new databases for image categorization are large-scale in the three dimensions (large number of exemples, high-dimension feature description, and large number of categories). The resulting learning problem is out of reach by standard optimization problems.

We developped a new approach exploiting the hidden underlying lower-dimension structure of this big data. We proposed a new family of algorithms (of the type coordinate results, or conditional gadient), whose iterations have an algorithmic complexity lower that an order compared to standard methods. For example, applied to learning problems with trace-norm penalization, our algorithm [26] exploit the atomic decomposition of the norm and compute only an approximate largest singular vector pair (instead of the whole singular value decomposition). Promising results [27] have been obtained on the image database Imagenet, where we show significant improvements compare to the state-of-the-art approaches (one-vs-rest strategies).

Semidefinite programming and combinatorial optimization

Participants : Nathan Krislock, Jérôme Malick.

We have worked with Frederic Roupin (Prof. at Paris XIII) on the use of semidefinite programming to solve combinatorial optimization problems to optimality. Within exact resolution schemes (branch-and-bound), “good” bounds are those with a “good” balance between tightness and computing times.

We proposed a new family of semidefinite bounds for 0-1 quadratic problems with linear or quadratic constraints [50] . The paradigm is to trade computing time for a (small) deterioration of the quality of the usual semidefinite bounds, in view of enhancing this efficiency in exact resolution schemes. Extensive numerical comparisons et tests showed the superior quality of our bounds, when embedded within branch-and-bound shemes, on standard test-problems (unconstrained 0-1 quadratic problems, heaviest k-subgraph problems, and graph bisection problems).

We have embedded the new bounds within branch-and-bound algorithms to solve 2 standard combinatorial optimization problems to optimality.

Max-cut. We developed [34] an improved bounding procedure obtained by reducing two key parameters (the target level of accuracy and the stopping tolerance of the inner Quasi-Newton engine) to zero, and iteratively adding triangle inequality cuts. We also precisely analyzed its theoretical convergence properties. We show that our method outperform the state-of-the-art solver ( [52] ) on the large test-problems.
Heaviest k-subgraph problems. Our previous work [51] takes advantage of the new bounds in their basic form to prune very well in the search tree. Its performances are then comparable with the best method (based on convex quadratic relaxation using CPLEX as an engine). Adapting and incorporating the tehniques we developped for the max-cut problem, we propose in [35] an big improveement of the first algorithm (up to 10 times faster). For the first time, we were able to solve exactly k-cluster instances of size 160. In practice, our method works particularly fine on the most difficult instances (with a large number of vertices, small density and small k).

Finally, we have worked on making our data sets available online together with a web interface for our solvers. We have also started working on a generic online semidefinite-based solver for binary quadratic problems using the generality of [50] . All this is publicly available on line at http://lipn.univ-paris13.fr/BiqCrunch/ .

Unified theory of inaccurate bundle methods

Participants : Claude Lemaréchal, Welington Oliveira.

Convergence of bundle methods is an intricate subject. The situation is even worse in the inexact case, where many variants exist, each with its specific ad hoc proof techniques.

With C. Sagastizábal (Rio de Janeiro), we have developed a synthetic theory to single out the successive steps when proving convergence of a generic algorithm, as well as the specific hypotheses that they need. Our pattern covers all variants published so far and suggests a new one. The corresponding paper is being finalized.

Stabilizing marginal prices in electricity production

Participants : Claude Lemaréchal, Jérôme Malick, Sofia Zaourar.

Unit-commitment optimization problems in electricity production are large-scale, nonconvex and heterogeneous, but they are decomposable by Lagrangian duality. Realistic modeling of technical production constraints makes the dual objective function computed inexactly though. An inexact version of the bundle method has been dedicated to tackle this difficulty [48] . However, the computed optimal dual variables show a noisy and unstable behaviour, that could prevent their use as price indicator. We propose a simple and controllable way to stabilize the dual optimal solutions, by penalizing the total variation of the prices [36] . Our illustrations on the daily electricity production optimization of EDF show a strinking stabilization at a negligible cost.

Previous |

Home | Next next